wo 2005/008596 



39 



PCT/EP2004/051190 



1. An Algorithm for projecting information data 
belonging to a multidimensional space into a space 

5 having less dimensions coxqprising the following steps: 

Providing a database of N-dimensional data in the 
form of records having a certain number of variables. 

Defining a metric function for calculating a 
distance between each record of the database. 
10 Calculating a matrix of distances between each 

record of the database by means of the metric function 
defined at the previous step 

Defining a n-1 dimensional space in which each 
record is defined by n-1 coordinates. 
15 Calculating the n-1 coordinates of each record in 

the n-1 dimensional space by means of an evolutionary 
algorithm; 

Defining as the best projection of the records 
onto the n-1 dimensional space the projection in which 
20 the distance matrix of the records in the n-1 
dimensional space best fits or has minimum differences 
with the distance matrix of the records calculated in 
the n-dimensional space. 

2. An algorithm according to claim 1 in which a 
25 database is provided in which the distances between the 

records are already contained. 

3. An algorithm according to claim 1 or 2, 
characterised in that as an evolutionary algorithm a so 
called genetic algorithms is used. 

30 4. An algorithm according to one or more of the 

preceding claims^ characterised by the following steps: 

encoding each individual record or variable 
represented by a point having coordinate X and Y; 
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defining a set of different X and Y coordinates 
for each point forming a first population of 
projections solution onto the less dimensional space ^ 
usually a two or three dimensional space. 
5 Calculating the fitness score for each of the 

projections of this first population by using as the 
fitness function the matrix of distances of the single 
points in the originally N dimensional spacer- 
Subjecting the population of projections to 

10 combination according to certain combination rules thus 
producing a first generation population of projections 
which conqprises X and Y coordinates for the points 
which are a coinbination of the coordinates provided in 
two projections of the parent generation; 

15 Calculating the fitness score of the projections 

of the first generation and forming again a new 
generation basing on the first generation. 

5. An Algorithm according to one or more of the 
preceding claims ^ characterised in that the genetic 

20 algorithm is the so called GenD algorithm. 

6. An algorithm according to one or more of the 
preceding claims, characterised in that a hidden point 
can be defined, which corresponds to a hidden record or 
a to a hidden variable and whose existence is only 

25 guessed, the said hidden point is added in the parent 
population by giving to it position coordinates Xhi and 
Yhi in the projection. 

7. An algorithm according to claim 6, 
characterised in that the calculation of the 

30 evolutionary algorithm is carried out in parallel with 
the hidden point and without the hidden point and the 
best fit projections obtained by the two parallel 
calculations is coxqpared. 
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8. An algorithm according to one or more of the 
preceding claims characterised by the further steps of 
providing a database cosqprising a certain number of 
records each one characterised by a certain number of 

5 variables , 

elaborating the database alternatively or in 
parallel according to two ways: 

a first way by which the records are considerd as 
being points and the variables as being the coordinates 
10 of the points. 

A second way by which the variables are 
considered as being points and the records are the 
coordinates . 

9. An algorithm according to one or more of the 
15 preceding claims, comprising a further different 

algorithm treating the database as a pre or post 

processing phase. 

10 An algorithm according to claim 9^ 

characterised in that the database is processed in a 
20 preventive stage by means of a Self Organising Map 

algorithm, the clusters formed by this algorithm in the 

different units being subjected to a projection by 

means of the algorithm according to one or more of the 

proceeding claims. 
25 11. A method for the cognitive analysis of 

multidimensional information data conqprising the 

following steps: 

providing a database with a certain number of 

records each one comprising a certain nuniber of 
30 variables and which are relative to a N-dimensional 

space ; 

projecting the database considering the record as 
points and the variables as coordinates or the 
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variables as points and the records as coordinates onto 
a space having a reduced number of dimension relatively 
to the space N dimensional space; 

the projection being carried out by means o£ an 
5 Algorithm for projecting information data belonging to 
a multidimensional space into a space having less 
dimensions comprising the following steps: 

Calculating a matrix of distances between each 
point defined by a record or a variable of the database 
10 by means of a metric function; 

Defining a n-l dimensional space in which each 
point represented by a record or a variable is defined 
by n-1 coordinates. 

Calculating the n-l coordinates of each point in 
15 the n-1 dimensional space by means of an evolutionary 
algorithm; 

Defining as the best projection of the points onto 
the n-1 dimensional space the projection in which the 
distance matrix of the points in the n-1 dimensional 
20 space best fits or has minimum differences with the 
distance matrix of the points calculated in the n- 
dimensional space. 

12. A method according to claim 11 characterised 
in that a database is provided in which the distances 

25 between the records are already contained. 

13. A method according to claims 11 or 12 ^ 
characterised in that as an evolutionary algorithm a so 
called genetic algorithms is used. 

14. A method according to one or more of the 
30 preceding claims 11 to 13, characterised by an 

algorithm executing the following steps: 

encoding each individual record or variable 
represented by a point having coordinate X and Y; 
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defining a set of different X and Y coordinates 
for each point forming a first population of 
projections solution onto the less dimensional space ^ 
usually a two or three dimensional space. 
5 Calculating the fitness score for each of the 

projections of this first population by using as the 
fitness function the matrix of distances of the single 
points in the originally N dimensional space; 

Subjecting the population of projections to 

10 combination according to certain combination rules thus 
producing a first generation population of projections 
which comprises X and Y coordinates for the points 
which are a cooibination of the coordinates provided in 
two projections of the parent generation; 

15 Calculating the fitness score of the projections 

of the first generation and forming again a new 
generation basing on the first generation. 

15. A method according to one or more of the 
preceding claims 11 to 14, characterised in that the 

20 evolutionary algorithm is the so called GenD algorithm. 

16. A method according to one or more of the 
preceding claims 11 to 15, characterised in that a 
hidden point r^resented by a hidden record or a hidden 
variable can be defined, which corresponds to a hidden 

25 point on the map and whose existence is only guessed, 
the said hidden point being added in the parent 
population by giving to its position coordinates Xhi and 
Yhi in the projection. 

17. A method according to claim 16, characterised 
30 in that the calculation of the evolutionary algforithm 

is carried out in parallel with the hidden point and 
without the hidden point and the best fit projections 
obtained by the two parallel calculations is compared. 
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18. A method according to one or more of the 
preceding claims characterised by the further steps o£ 
providing a database con^rising a certain nuinber of 
records each one characterised by a certain nuniber of 

5 variables , 

elaborating the database alternatively or in 
parallel according to two ways: 

a first way by which the records are considered 
as being points and the variables as being the 
10 coordinates of the points. 

A second way by which the variables are 
considered as being points and the records are the 
coordinates . 

19. A method according to one or more of the 
15 preceding claims 11 to 18, conqprising a further 

different algorithm treating the database as a pre or 
post processing phase. 

20 A method according to claim 19^ characterised 
in that the database is processed in a preventive stage 
20 by meajis of a Self Organising Map algorithm^ the 
clusters formed by this algorithm in the different 
units being subjected to a projection by means of the 
algorithm according to one or more of the proceeding 
claims. 

25 21. A method according to one or more of the 

preceding claims 11 to 20, characterised in that the 
clustering or distamce of the points on the map on 
which the database has been projected is used as a 
measure of similarity of the records or of the 

30 variables related to the said point. 

22. A method according to one or more of the 
preceding claims, characterised in that a database 
comprising a certain number of records each one being 
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related to a certain nuniber of variables is provided 
to the said database being further added the 
complementary variables to the variables originally 
provided and the said integrated database being 
5 siibjected to projection onto a less dimensional space ^ 
particularly on a two or three dimensional space; 

The distance in the map between each variable and 
its complementary variable being used as a measure for 
the relevance of said variable in the database. 

10 23 A method according to one or more of the 

preceding claims^ characterised in that it is a method 
for evaluating the relevance of certain variables in 
determining a certain pathological status of 
individuals and a method for defining prototypes of 

15 individuals relatively to the variables of the database 
and their probability of developing a certain disease. 

24. A method according to one or more of the 
preceding claims characterised in that it is a method 
for analysing the probability of an individual of 

20 having or of developing the Alzheimer disease. 

25. A method according to one or more of the 
preceding claims 11 to 27 , characterised in that it is 
in the form of a program saved on a removable support. 

26 An Algorithm according to one or more of the 
25 preceding claims 1 to 10^ characterised in that it is 
in the form of a program saved on a removable support. 

27. A method according to one or more of the 
preceding claims 11 to 22, characterised in that it is 
a method for generating two dimensional maps of 

30 geographic sites starting form a database comprising 
relative distances of the sites. 

28. A method for generating two or three 
dimensional maps of geographic sites starting form a 
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database coxnprising relative distances of the sites, 
the method comprising the following steps of: 

a) organizing the known or measured distance 
values of the geographical sites in a matrix form; 
5 b) defining a two or a three dimensional space in 

which the position of each site is univoguely defined 
by two or three coordinates; 

c) determining the two or three coordinates of the 
position in the two or three dimensional space of each 

10 geographical site by means of an evolutionary 
algorithm; 

d) determining the distances of the geographical 
sites one form the other by means to the calculated two 
or three dimensional coordinates of position of the 

15 said geographical sites one from the other; 

e) generating a matrix of distances with the 
distances values which has been determined according to 
step d) ; 

f) defining as the best two or three dimensional 
20 coordinates of position of the totality of the 

geographical sites in the two or three dimensional 
space, the two or three dimensional coordinates of 
position of the said geographical sites for which the 
distance matrix determined therefrom best fits or has a 
25 minimum difference with the distance matrix of the 
known or measured distance values of the geographical 
sites . 

29. A method according to claims 27 or 28 
characterized in that the evolutionary algorithm is a 

30 so called genetic algorithm. 

30. A method according to one or more of the 
preceding claims 27 to 29, characterized in that 
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a) for each geographical site a first and a second 
set of coordinates defining the position of each 
geographical site in the two or three dimensional space 
is calculated; 

5 b) calculating the fitness score of the matrix of 

distances among the geographical sites determined by 
means of the first and second set of coordinates 
defining the position of each geographical site in the 
two or three dimensional space ^ by using as a fitness 
10 function the matrix of the known or measured distances 
of the geographical sites; 

c) subjecting the first and second set of 
coordinates of position of each geographical site to 
coxhbination for each geographical site according to 

15 predetermined combination rules ^ thus producing at 
least a new first and second set of coordinates of 
position for each geographical site; 

d) calculating the fitness score of the said new 
first and second set of coordinates of position 

20 according to step b) ; 

e) subjecting the said new first and second set of 
coordinates of position again to combination according 
to step c) and repeating the said steps c to e) until 
at least one new first or second set of coordinates of 

25 position reaches a maximum fitness score or is greater 
than a minimum predefined fitness score. 

31. A method according to claim 30, characterized 
in that for each combination steps of at least one 
first and one second set of coordinates several new 

30 sets of coordinates are obtained by the combinations of 
the said at least first and second set of coordinates. 

32 . A method according to one or more of the 
preceding claims characterized in that at least one 
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hidden or hypothetic geographical site is added to the 
database of geographical sites of which neither the 
coordinates nor the distances are known and a first and 
a second set: of coordinates for the said at least one 
5 geographical site being freely defined. 

33. A method according to claim 32, characterised 
in that the calculation of the evolutionary algorithm 
is carried out in parallel for the database provided 
with the hidden or hypothetic geographical site and 

10 without the hidden or hypothetic geographical site and 
the best fit set of coordinates of position of the 
totality of the geographical sites obtained by the two 
parallel calculations are compared. 

34. A method according to one or more of the 
15 preceding claims 27 to 33, characterised in that a 

further different pre or post processing phase is 
provided. 

35. A method according to claim 34, characterised 
in that in a preventive phase the known distance data 

20 matrix is subjected to treatment by means of a Self 
Organising Map algorithm, the clusters formed by this 
algorithm in the different units being then subjected 
to the method steps according to one or m ore of the 
preceding claims 27 to 34. 

25 36. A method according to one or more of the 

preceding claims 11 to 22, characterised in that it is 
a method for representing the structure of a molecule 
onto a three dimensional or a two dimensional steps by 
indicating only the relative distances of the atoms of 

30 the molecule. 

37. A method for representing the structure of a 
molecule onto a three dimensional or a two dimensional 
steps by indicating only the relative distances of at 
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least part of the atoms o£ the molecule relatively one 
to the other, characterised in that it comprises the 
following steps: 

a) organizing the known or measured distance 
5 values of the atoms in a matrix form; 

b) defining a two or a three dimensional space in 
which the position of each atom is univoquely defined 
by two or three coordinates; 

c) determining the two or three coordinates of the 
10 position in the two or three dimensional space of each 

atom by means of an evolutionary algorithm; 

d) determining the distances of the atoms one form 
the other by means to the calculated two or three 
dimensional coordinates of position of the said atoms 

15 one from the other; 

e) generating a matrix of distances with the 
distances values which has been determined according to 
step d) ; 

f) defining as the best two or three dimensional 
20 coordinates of position of the totality of the atoms in 

the two or three dimensional space, the two or three 
dimensional coordinates of position of the said atoms 
for which the distance matrix determined therefrom best 
fits or has a minimum difference with the distance 
25 matrix of the known or measured distance values of the 
atoms. 

38. A method according to claims 36 or 37 
characterized in that the evolutionary algorithm is a 
so called genetic algorithm. 
30 39. A method according to one or more of the 

preceding claixns 36 to 38, characterized in that 
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a) for each atom a first and a second set of 
coordinates defining the position of atom in the two or 
three dimensional space is calculated; 

b) calculating the fitness score of the matrix of 
S distances among the atoms determined by means of the 

first and second set of coordinates defining the 
position of each atom in the two or three dimensional 
space, by using as a fitness function the matrix of the 
known or measured distances of the atoms; 

10 c) subjecting ttie first and second set of 

coordinates of position of each atom to combination for 
atom according to predetermined combination rules, thus 
producing at least a new first and second set of 
coordinates of position for eacdi atom; 

15 d) calculating the fitness score of the said new 

first and second set of coordinates of position 
according to step b) ; 

e) subjecting the said new first and second set of 
coordinates of position again to combination according 

20 to step c) and repeating the said steps c to e) until 
at least one new first or second set of coordinates of 
position reaches a maximum fitness score or is greater 
than a minimum predefined fitness score. 

40. A method according to claim 39, characterized 
25 in that for each coinbination st^s of at least one 

first and one second set of coordinates several new 
sets of coordinates are obtained by the cosibinations of 
the said at least first and second set of coordinates. 

41 . A method according to one or more of the 
30 preceding claims 36 to 40 characterized in that at 

least one hidden or hypothetic atom is added to the 
database of geographical sites of which neither the 
coordinates nor the distances are known and a first and 
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a second set of coordinates for the said at least one 
atom being freely defined. 

42. A method according to claim 41, characterised 
in that the calculation of the evolutionary algorithm 

5 is carried out in parallel for the database provided 
with the at least one hidden or hypothetic atom and 
without the at least one hidden or hj^othetic atom and 
the best fit set of coordinates of position of the 
totality of the atoms obtained by the two parallel 
10 calculations are compared. 

43. A method according to one or more of the 
preceding claims 36 to 42, characterised in that a 
further different pre or post processing phase is 
provided. 

15 44. A method according to claim 43, characterised 

in that in a preventive phase the known distance data 
matrix is subjected to treatment by means of a Self 
Organising Map algorithm, the clusters formed by this 
algorithm in the different units being then sxabjected 

20 to the method steps according to one or m ore of the 
preceding claims 36 to 43. 

45. A method according to one or more of the 
preceding claims 41 to 44, characterised in that it is 
a method for finding out the presence and/or the 

25 position of at least an unknown or hidden atom in the 
structure of the molecule. 

46. An apparatus having artificial intelligence, 
the said apparatus being provided with a processing 
unit (1) , the said processing unit being connected to a 

30 data memory (3) and to a program memory (2) ; 

The processing unit being further connected to one 
or more different sensors for detecting or measuring 
different physical and/or chemical conditions or 
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effects or processes characterising or occurring in the 
environment ; 

The processing unit further being connected to 
data input means by means of a aervxcB person or a data 
5 input line from other data collecting apparati; 

The processing unit being also connected to means 
for carrying out mechanical^ physical or chemical 
actions, such as actuators or the like; 

in the program memory a program executable by the 
10 processing unit being loaded which program has a 
routine for driving the sensors and saving in a 
univoquely recognizable way each data collected by the 
sensors and or for saving data inputted by a semrice 
person or by other apparati, driver for activating or 
15 deactivating the means for carrying out mechanical, 
physical or che m ical actions, such as actuators or the 
like; 

the program stored in the program memory further 
comprising means for evaluating the data collected by 

20 the sensors and/or the data inputted by a service 
person or by other apparati; 
characterised in that 

the program having a sxibroutine for executing an 
algorithm according to one or more of the preceding 

25 claims 1 to 11 on the collected and/or inputted data. 

47. An apparatus according to claim 46^ 
characterised in that the program is provided with a 
subroutine for carrying out a cognitive analysis of 
multidimensional data by treating the said data with a 

30 method for projecting information data belonging to a 
multidimensional space into a space having less 
dimensions comprising the following steps; 
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Providing a database of N-dimensional data in the 
form o£ records having a certain nuniber o£ variables. 

Defining a metric function for calculating a 
distance between each record of the database. 
5 Calculating a matrix of distances between each 

record of the database by means of the metric function 
defined at the previous step 

Defining a n-1 dimensional space in which each 
record is defined by n-1 coordinates. 
10 Calculating the n-1 coordinates of each record in 

the n-1 dimensional space by means of an evolutionary 
algorithm; 

Defining as the best projection of the records 
onto the n-1 dimensional space the projection in which 
IS the distance matrix of the records in the n-1 
dimensional space best fits or has minimum differences 
with the distance matrix of the records calculated in 
the n-dimensional space. 

48. An i^paratus according to claim 47^ 
20 characterised in that a database is provided in which 

the distances between the records are already 
contained. 

49. An apparatus according to claims 47 or A8f 
characterised in that as an evolutionaxry algorithm a so 

25 called genetic algorithms is used. 

50. An apparatus according to one or more of the 
preceding claims 47 to 49, characterised in that the 
subroutine provides for the following steps encoding 
each individual record or variable represented by a 

30 point having coordinate X and Y; 

defining a set of different X and Y coordinates 
for each point forming a first population of 
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projections solution onto the less dimensional space ^ 
usually a two or three dimensional space. 

Calculating the fitness score for each of the 
projections of this first population by using as the 
5 fitness function the matrix of distances of the single 
points in the originally N dimensional space; 

Subjecting the population of projections to 
combination according to certain combination rules thus 
producing a first generation population of projections 
10 which comprises X and Y coordinates for the points 
which are a condbination of the coordinates provided in 
two projections of the parent generation; 

Calculating the fitness score of the projections 
of the first generation and forming again a new 
15 generation basing on the first generation. 

51. An apparatus according to one or more of the 
preceding claims 47 to 50, characterised in that the 
genetic algorithm is the so called GenD algorithm. 

52. An apparatus according to one or more of the 
20 preceding claims 47 to 51, characterised in that a 

hidden point can be defined^ which corresponds to a 
hidden record or a to a hidden variable and whose 
existence is only guessed, the said hidden point is 
added in the parent population by giving to it position 
25 coordinates and Yhi in the projection. 

53. An apparatus according to one or more of the 
preceding claims 47 to 52, characterised in that the 
calculation of the evolutionary algorithm is carried 
out in parallel with the hidden point and without the 

30 hidden point and the best fit projections obtained by 
the two parallel calculations is conqpared. 

54- An apparatus according to one or more of the 
preceding claims 47 to 53, characterised by the further 
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steps of providing a database con^rising a certain 
number of records each one characterised by a certain 
number of variables, 

elaborating the djatabase alternatively or in 
S parallel according to two ways: 

a first way by which the records are considered 
as being points and the variables as being the 
coordinates of the points. 

A second way by which the variables are 
10 considered as being points and the records are the 
coordinates • 

55. An apparatus according to one or more of the 
preceding claims 47 to 54, characterised in that it 
comprises a further different treatment of the data as 

15 a pre or post processing phase. 

56. An apparatus according to one or more of the 
preceding claims 55 characterised in that the data is 
processed in a preventive step by means of a Self 
Organising Map algorithm, the clusters formed by this 

20 algorithm in the different units being then subjected 
to treatment according to one or more of the proceeding 
claims 47 to 55. 

57. An apparatus according to one or more of the 
preceding claims, characterised in that a relationship 

25 between the collected data records of the collected 
data is determined by means of the distance of each 
data record from the other data records and the said 
distance being used as a relevance weight of each data 
record in determining the activation or deactivation 

30 of one or more of the means for carrying out 
mechanical, physical and or chemical actions. 

58- An apparatus according to claim 57, 
characterised in that a maximum distance for each data 



wo 2005/008596 



56 



PCT/EP2004/051190 



record from the other is set for discriminating the 
data records to be used in detemdning the activation 
or deactivation of one or more of the means for 
carrying out mechanical, physical and or chemical 
5 actions . 

59. An apparatus according to claims 56 or 57, 
characterised in that the clustering or distance of the 
data records on the map on which the database has been 
projected is used as a measure of similarity of the 

10 data records or of the variables related to the said 
data records. 

60 . A Method for providing an apparatus having 
artificial intelligence with a simulation of intuitive 
behaviour, the said aj^aratus comprising: 

15 A processing unit (1) , the said processing unit 

being connected to a data memory (3) and to a program 
memory (2) ; 

The processing unit being further connected to one 
or more different sensors for detecting or measuring 
20 different physical and/or chemical conditions or 
effects or processes characterising or occurring in the 
environment ; 

The processing unit further being connected to 
data input means by means of a service person or a data 
25 input line from other data collecting apparati; 

The processing unit being also connected to means 
for carrying out mechanical, physical or chemical 
actions, such as actuators or the like; 

in the program memory a program executable by the 
30 processing unit being loaded which program has a 
routine for driving the sensors and saving in a 
univoquely recognizable way each data collected by the 
sensors and or for saving data inputted by a service 
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person or by other apparati, driver for activating or 
deactivating the means for carrying out mechanical, 
physical or chemical actions, such as actuators or the 
like; 

S the program stored in the program memory further 

conqprising means for evaluating the data collected by 
the sensors and/or the data inputted by a service 
person or by other apparati; 

The method being characterised by the following 
10 steps : 

generating from the collected and/or inputted data 
a database with a certain number of records each one 
comprising a certain number of variables and which are 
relative to a N-dimensional space; 

15 projecting the database considering the record as 

points and the variables as coordinates or the 
variables as points and the records as coordinates onto 
a space having a reduced number of dimension relatively 
to the space N dimensional space; 

20 the projection being carried out by means of an 

Algorithm for projecting information data belonging to 
a multidimensional space into a space having less 
dimensions comprising the following steps: 

Calculating a matrix of distances between each 

25 point defined by a record or a variable of the database 
by means of a metric function; 

Defining a n-1 dimensional space in which each 
point represented by a record or a variable is defined 
by n-1 coordinates. 

30 Calculating the n-1 coordinates of each point in 

the n-1 dimensional space by means of an evolutionary 
algorithm; 
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Defining as the best projection of the points onto 
the n-1 dimensional space the projection in which the 
distance matrix of the points in the n-1 dimensional 
space best fits or has minimum differences with the 
5 distance matrix of the points calculated in the n- 
dimensional space. 

61. A method according to claim 60, characterised 
in that a database is provided in which the distances 
between the records are already contained. 
10 62. A method according to claims 60 or 61, 

characterised in that as an evolutionary algorithm a so 
called genetic algorithm is used. 

63 . A method according to one or more of the 
preceding claims 60 to 62, characterised by an 
15 algorithm executing the following steps: 

encoding each individual record or variable 
represented by a point having coordinate X and Y; 

defining a set of different X and Y coordinates 
for each point forming a first population of 
20 projections solution onto the less dimensional space, 
usually a two or three dimensional space. 

Calculating the fitness score for each of the 
projections of this first population by using as the 
fitness function the matrix of distances of the single 
25 points in the originally N dimensional space; 

Subjecting the population of projections to 
combination according to certain conibination rules thus 
producing a first generation population of projections 
which comprises X and Y coordinates for the points 
30 which are a coinblnation of the coordinates provided in 
two projections of the parent generation; 
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Calculating the fitness score of the projections 
of the first generation and forming again a new 
generation basing on the first generation. 

64. A method according to one or more of the 
5 preceding claims 60 to 63, characterised in that the 

evolutionary algorithm is the so called GenD algorithm. 

65. A method according to one or more of the 
preceding claims 60 to 64, characterised in that a 
hidden point represented by a hidden record or a hidden 

10 variable can be defined, which corresponds to a hidden 
point on the map and whose existence is only guessed, 
the said hidden point being added in the parent 
population by giving to its position coordinates Xhi and 
Yhi in the projection. 
15 66. A method according to claim 65, characterised 

in that the calculation of the evolutionary algorithm 
is carried out in parallel with the hidden point and 
without the hidden point and the best fit projections 
obtained by the two parallel calculations is compared. 
20 67. A method according to one or more of the 

preceding claims 60 to 66 characterised by the further 
steps of providing a database comprising a certain 
number of records each one characterised by a certain 
nuxnber of variables, 
25 elaborating the database alternatively or in 

parallel according to two ways: 

a first way by which the records are considerd as 
being points and the variables as being the coordinates 
of the points. 

30 A second way by which the variables are 

considered as being points and the records are the 
coordinates . 
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68 . & method according to one or more of the 
preceding claims 60 to 67 , comprising a further 
different algorithm treating the database as a pre or 
post processing phase. 
5 69 A method according to claim 68, characterised 

in that the database is processed in a preventive stage 
by means of a Self Organising Nap algorithm, the 
clusters formed by this algorithm in the different 
units being subjected to a projection by means of the 
10 algorithm according to one or more of the proceeding 
claims. 

70 • A method according to one or more of the 
preceding claims 60 to 70, characterised in that the 
clustering or distance of the points on the map on 

15 which the database has been projected is used as a 
measure of similarity of the records or of the 
variables related to the said point. 

71. A method according to one or more of the 
preceding claims, characterised in that a database 

20 comprising a certain number of records each one being 
related to a certain nuinber of variables is provided, 
to the said database being further added the 
complementary variables to the variables originally 
provided and the said integrated database being 

25 subjected to projection onto a less dimensional space, 
particularly on a two or three dimensional space; 

The distance in the map between each variable and 
its conqplementary variable being used as a measure for 
the relevance of said variable in the database. 

30 



